Multi-armed bandit

Results: 113



#Item
71Artificial intelligence / Detection theory / Game theory / Minimax / Binary tree / Multi-armed bandit / Regret / Statistics / Decision theory / Mathematics

Journal of Machine Learning Research[removed]1695 Submitted 1/10; Revised 11/10; Published 5/11 X -Armed Bandits S´ebastien Bubeck

Add to Reading List

Source URL: www.princeton.edu

Language: English - Date: 2011-06-28 04:19:16
72Multi-armed bandit / Stochastic optimization / Artificial intelligence / Game theory / Decision theory / Probability distribution / Minimax / Statistics / Mathematics / Machine learning

Pure Exploration in Multi-Armed Bandits Problems S´ebastien Bubeck1 , R´emi Munos1 , and Gilles Stoltz2,3 1 2

Add to Reading List

Source URL: www.princeton.edu

Language: English - Date: 2011-06-28 04:16:37
73Heuristics / Multi-armed bandit / Statistical hypothesis testing / Normal distribution / Statistics / Design of experiments / Heuristic function

Proceedings of the IEEE Conf. on Decision and Control, Maui, HI, 2012 Towards optimization of a human-inspired heuristic for solving explore-exploit problems Paul Reverdy1 , Robert C. Wilson2 , Philip Holmes1,3 and Naom

Add to Reading List

Source URL: www.princeton.edu

Language: English - Date: 2012-09-07 12:48:01
74Machine learning / Multi-armed bandit / Chernoff bound / Martingale / Concentration inequality / Statistics / Stochastic processes / Stochastic optimization

JMLR: Workshop and Conference Proceedings vol[removed]–23 The best of both worlds: stochastic and adversarial bandits S´ebastien Bubeck SBUBECK @ PRINCETON . EDU

Add to Reading List

Source URL: www.princeton.edu

Language: English - Date: 2012-08-21 11:04:32
75Stochastic optimization / Permutation / Statistics / Machine learning / Multi-armed bandit

Pure Exploration in Finitely–Armed and Continuous–Armed Bandits S´ebastien Bubeck∗ INRIA Lille – Nord Europe, SequeL project, 40 avenue Halley, 59650 Villeneuve d’Ascq, France

Add to Reading List

Source URL: www.princeton.edu

Language: English - Date: 2011-06-28 04:17:55
76Operations research / Machine learning / Analysis of algorithms / Computational complexity theory / Multi-armed bandit / Stochastic optimization / Time complexity / Algorithm / Regret / Theoretical computer science / Applied mathematics / Mathematics

JMLR: Workshop and Conference Proceedings[removed]–819 24th Annual Conference on Learning Theory A simple multi-armed bandit algorithm with optimal variation-bounded regret

Add to Reading List

Source URL: www.jmlr.org

Language: English - Date: 2012-01-02 12:01:12
77Stochastic optimization / Markov models / Artificial intelligence / Dynamic programming / Bandit / Markov decision process / Stochastic matrix / Game theory / Stochastic / Statistics / Machine learning / Multi-armed bandit

R Foundations and Trends in Machine Learning Vol. 5, No[removed]–122 c 2012 S. Bubeck and N. Cesa-Bianchi

Add to Reading List

Source URL: www.princeton.edu

Language: English - Date: 2012-12-16 05:54:25
78Science / Multi-armed bandit / Stochastic optimization / Game theory / Reinforcement learning / Determinacy / Structure / Statistics / Mathematics / Machine learning

Wisdom of the Crowds vs. Groupthink: Learning in Groups and in Isolation Conor Mayo-Wilson, Kevin Zollman, and David Danks November 30, 2010 Technical Report No. CMU-PHIL-188 Philosophy

Add to Reading List

Source URL: www.hss.cmu.edu

Language: English - Date: 2010-11-30 14:12:58
79Operations research / Stochastic processes / Dynamic programming / Combinatorial optimization / NP-complete problems / Knapsack problem / Multi-armed bandit / Martingale / Randomized rounding / Statistics / Theoretical computer science / Applied mathematics

Approximation Algorithms for Correlated Knaspacks and Non-Martingale Bandits Anupam Gupta∗ Ravishankar Krishnaswamy∗

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2011-02-14 10:17:58
80Marketing / Auction theory / Auctioneering / Decision theory / Multi-armed bandit / Regret / Pricing strategies / Auction / Futures contract / Business / Statistics / Pricing

The Value of Knowing a Demand Curve: Bounds on Regret for On-line Posted-Price Auctions ∗ Robert Kleinberg †

Add to Reading List

Source URL: www.akamai.com

Language: English - Date: 2006-09-21 21:38:54
UPDATE